Discriminatory Association Analysis on Semi-structured Data

نویسندگان

  • Binh Luong Thanh
  • Franco Turini
چکیده

Data mining has been applied to the discovery of illegally discriminatory treatments caused by protected-by-law attributes such as race, gender, age, etc. In this paper, we propose an improvement for the previous work of exploring discrimination in semi-structured business data. The main idea is that discrimination represented in the form of association rules is judged by opposite patterns whose components are almost the same except a single sensitive attribute and the decision (admission to school, acceptance to a position, etc.) However, the previous work requires numerous efforts and it has not been modeled in a systematic way. In order to solve this limitation, semantic analysis is integrated in the discrimination mining process showing better results in comparison with the previous work in experimental outcome.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Association Analysis of Semi-structured Data for Discrimination Discovery in Business

Data mining techniques have taken a critical role in life in numerous domains such as consumer analytics, finance, banking, medicine, biology, and astronomy... Recently, data mining techniques have found their application also in discovering illegal discriminatory treatment on the bases of sensitive attributes such as race, color, religion, nationality, gender, age... In this paper, we propose ...

متن کامل

Survey on Mining in Semi-Structured Data

Emerging technologies of semi-structured data have attracted wide attention of networks, e-commerce, information retrieval and databases. In these applications, the data are modeled not as static collections but as transient data streams, where the data source is an unbounded stream of individual data items. It is becoming increasingly popular to send heterogeneous and ill-structured data throu...

متن کامل

Discovering Association Rules in Semi-structured Data Sets

The discovery of association rules is one of the classic problems of data mining. Typically, it is done over well-structured data, such as databases. In this paper, we present a method of discovery of association rules in semi-structured data, namely, in a set of conceptual graphs. The method is based on conceptual clustering of the data and constructing of a conceptual hierarchy. A feature of ...

متن کامل

Mining Association Rules from Semi-Structured Data

Despite the growing popularity of semi-structured data such as Web documents, most knowledge discovery research has focused on databases containing well structured data. In this paper, we try to find useful information from semistructured data. In our approach, we begin by representing semi-structured data in a prototype-based approach. We then detect the most typical common structure of semist...

متن کامل

Designing a decision support system to predict the success of research centers with discriminatory analysis DEA

Research centers have an important place in promoting science and technology nationwide. On the other hand, given the limitations in allocating the funds and the facilities needed to establish these centers, it is important to decide on the selection of priority centers. In this decision - making process, several factors, such as requirements, priorities and strategies, capabilities, and balanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011